SEQUENCE LISTING 



< 1 1 0> Mercian Corporation 
<110> Eisai Co., Ltd 



-2 



<120> DNA related to hydroxy I at ion of macrolide compounds 

<1.30> 04063PCT 

<150> JP 2003-396828 
<151> 2003-11-27 

<160> 19 

<210> 1 

<211> 3793 

<212> DNA 

<213> Streptomyces sp. 

<220> 
<221> CDS 

<222> (1322).. (2548) 

<220> 
<221> CDS 

<222> (2564).. (2761) 
<400> 1 

ctgcagctcg acgtgcgggt cggacttcac gttgaagtac cagaccggat gcttgggcgc 
accgcccagc gaggcgaccg ccgcgtaact cccctcgtgc tcgacccgca tcagcggcgt 
cttgcggatc tttccgctgc gcgcgccccg ggtggtgagc acgatgaccg gcagcccggt 
gtcccgcagc gtggtgccct tggtgccccc ggaactctcg tacagctcga cctgctcgcg 
cacccactgc gtcgggctgg gctcgtactc gccctcaagt ggcaagggat ccgtctcctt 
cgtcggtccg gcggatggtg ctccggacgg tcccaactcc cgcggccgcc cggatcatcc 
gtaccgcatg ccccttcgcc cgagcgggtg atcaccgttc cggccatccg gtcgtccgca 
ccgcgagcac caggatcacg gcgctggaga gcagggccgt gaccagccgc ccccggtggc 
ccgtcagggc gcgacccagc agcgcgcccc cgcccgccag cagtagctgc cagctcgcgg 
acgcggcgaa ggccgccgcg gcgaacaccg cccgtttcag cggccgtgcc gcaccggcgg 
cgccgctgcc gagcaccagc gccacgaagt agaccaccgt catgggattg agcagggtga 
tcccgagaag gccgagataa gcccctgccg cgcctggaac cggccgttcc gggcgggtgg 
tgagccgatg ggcgcggtac tgccgcaggg cgagcagcgc cgcccgcagc gcgagcaccg 
cgaggaccag cgccgaggcc cagcgcagcg ggtccagcac cggccgcagc tgtgccgcga 
gggcggcgcc gcccacggtc gcgagcagcg cgtacagccc gtcggccgtg gcgacgccga 
gcgccgccga ggcgccggtg cgcagcgagg tgcgggcggt gagggagacc agataggtcc 
cgaccgcgcc gacgggcacc gcgatgccgt acccggcgag caggcccgcg agcagcgcgc 
ccgtcacggg cgtgcgggac tggttcctcc ggggacggcg gggctgctgt cggcccggca 
ccgcgggggc ggtggcagcg ggcgtcggca ggagggaggc tgtaggaggc atgggccgat 
cctggggccg ccgcgcccgc accggcaaat gaattacggc gcgttccagc ccccggccgg 
ctcgctcttc ggccacttca ccgcgtacgg cgatctggcc gaacttgctg tcgccccata 
ggtgcctcgg gcatctaatg aagatcggca cgacgcacct cttcgtctgc gaggtctttc 
c atg acg gaa ctg acg gac ate acc ggc ccg ggg acc ccg gec gaa 
Met Thr Glu Leu Thr Asp lie Thr Gly Pro Gly Thr Pro Ala Glu 
1 5 10 15 

ccc gtc gca ttc ccc cag gac cgc acc tgc ccc tac cac ccc ccc acc 
Pro Val Ala Phe Pro Gin Asp Arg Thr Cys Pro Tyr His Pro Pro Thr 

20 25 30 

gga tac ggc ccg ctg cgc gac ggg cgc age ctg tec cgc gtc acc etc 
Gly Tyr Gly Pro Leu Arg Asp Gly Arg Ser Leu Ser Arg Val Thr Leu 

35 40 45 

ttc gac ggc cgc gag gtc tgg atg gtc acg ggc cac gee acc gee cgc 
Phe Asp Gly Arg Glu Val Trp Met Val Thr Gly His Ala Thr Ala Arg 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1366 



1414 
1462 
1510 







50 










55 








gcg 


ctg 


etc 


gcg 


gac 


ccc 


egg 


ctg 


tec 


ace 


gac 


Ala 


Leu 


Leu 


Ala 


Asp 


Pro 


Arg 


Leu 


Ser 


Thr 


Asp 




65 










70 








ttc 


ccc 


gtg 


ccc 


acg 


gec 


cgc 


ttc 


gcg 


gee 


gtc 


Phe 


Pro 


Val 


Pro 


Thr 


Ala 


Arg 


Phe 


Ala 


Ala 


Val 


80 










85 










90 


gcg 


ctg 


etc 


ggc 


gtg 


gac 


gac 


ccg 


gtc 


cac 


cag 


Ala 


Leu 


Leu 


Gly 


Val 


Asp 


Asp 


Pro 


Val 


His 


Gin 










100 










105 




a tg 


ate 


ccg 


teg 


ttc 


ace 


etc 


aag 


cgc 


gcg 


gee 


Met 


1 le 


Pro 


Ser 


Phe 


Thr 


Leu 


Lys 


Arg 


Ala 


Ala 








115 










120 






ate 


cag 


egg 


ace 


gtc 


gac 


ggg 


ctg 


ctg 


gac 


RCR 
0 *-» 0 


1 le 


Gin 


Arg 


Thr 


Val 


Asp 


Gly 


Leu 


Leu 


Asp 


Ala 






130 










135 






ccg 


ccg 


gec 


gag 


ctg 


gtc 


tec 


gee 


ttc 


gee 


r t r 


Pro 


Pro 


Ala 


Glu 


Leu 


Val 


Ser 


Ala 


Phe 


Ala 


1 PI 1 




145 










150 










gt c 


ate 


t EC 


RRC 


r t r 


etc 


rrt 


R t R 
5 l 5 


r r e 


tar 


6LL 


Va 1 


1 1 e 


Cys 


GI v 


Leu 


1 PIJ 

1— c u 


GI v 

u » y 


Va I 

v a 1 


Prn 


T v r 
1 y 1 


A 1 a 

r\ 1 a 


160 










165 










170 


RaR 

o « o 


gaa 


cag 


tec 


cgc 


are 


r t R 


r t R 


r Rr 

L 0 L 


RR t 
65 l 


L L L 


G I u 


Glu 


Gin 


Ser 


A r p 


Thr 


1 pi 1 


1 PI 1 


A r p 


R 1 v 
vj 1 y 


Pro 










180 










1 8S 




caa 


RRR 

ooo 


Rr r 


cgc 


R3 R 
B a B 


r rp 


r t r 


pa p 

BOB 


pap 
e« b 


tar 

l a L 


c t r 
III 


Gin 

KJ 1 1 l 


GI v 

vj i y 


Al a 


A r p 

r\ i e 


Rl u 

VJ 1 u 


A r p 

r\ 1 e 


1 Oil 


RI11 

VJ 1 u 


Rl 11 
VJ 1 u 


Ty r 


Leu 








1 95 










?nn 

L.UU 






ear 


aag 


Ra R 


r rr 

L&& 


rap 

L a g 


or r 
5 V* L 


pa a 
B " 0 


r r r 

ILL 


ppt 


par 
5 0 L 


oar 
65L 


nop 




Rl u 


A r p 

r\ 1 S 


Rl n 

VJ 1 1 1 


A 1 a 


Rl 11 

VJ I u 


P rn 
r r u 


R 1 v 


Asp 


vj 1 y 






21 0 










21 5 






r t r 


ca c 


rag 


TRR 


rte 

Lie 


r er 


a r r 

OIL 


ppt 


pa p 


r t p 

L L & 


pa r 


Va 1 


His 


Gin 


Arg 


1 PIJ 

1— c u 


A r p 

#i l e 


Thr 
1 1 1 1 


Rl v 
vj 1 y 


Rl u 
vj r u 


1 P 1 1 


Asp 




225 










230 








ECR 


ctg 


gec 


gt c 


ate 


ctg 


r t r 
tit 


R t R 


er r 

O V-» L 


eee 


par 
L a L 


Ala 


Leu 


Ala 


Va I 


1 1 e 


Leu 


1 pij 

l— c u 


Va 1 


Al a 


R 1 v 
u 1 y 


H i <: 


240 










245 










a tg 


ate 


tec 


etc 


RRC 
oo 


ace 


t a c 


acg 


ctg 


r t r 
lib 


r rr 
Lee 


Met 


1 1 e 


Ser 


Leu 


GI v 

VJ 1 J 


Thr 


T v r 
• y 1 


Thr 

1 1 i 1 


1 Pll 

l— c u 


Leu 


A r p 

r\ 1 e 










260 










265 


gec 


gag 


ctg 


cgc 


gec 


gac 


ccg 


Rr r 


ctg 


r t r 

L l 0 


rrr 

L L L 


Ala 


Glu 


Leu 


Arg 


Ala 


Asp 


Pro 


Ala 


Leu 


1 PIJ 


Pro 








275 










280 






c tg 


a tg 


egg 


a tg 


etc 


teg 


ate 


RCR 


gac 


RRR 
000 


ctg 


Leu 


Met 


Arg 


Met 


Leu 


Ser 


1 le 


Ala 


Asp 


Gly 


Leu 






290 










295 






gag 


gac 


ate 


gag 


ate 


gec 


ggc 


gee 


acg 


ate 


CRR 

*■» 00 


Glu 


Asp 


1 1 e 


Glu 


Me 


Ala 


Gly 


Ala 


Thr 


1 le 


Arg 




305 










310 








ctg 


ttc 


tec 


ace 


teg 


ctg 


ate 


aac 


cgc 


gac 


gag 


Leu 


Phe 


Ser 


Thr 


Ser 


Leu 


Me 


Asn 


Arg 


Asp 


Glu 


320 










325 










330 


ccc 


gac 


ace 


ctg 


gac 


ttc 


cac 


cgc 


tec 


acc 


cgc 


Pro Asp 


Thr 


Leu 


Asp 


Phe 


His 


Arg 


Ser 


Thr 


Arg 










340 










345 


ggt 


ttc 


ggc 


ate 


cac 


cag 


tgc 


ctg 


ggc 


cag 


aac 


Gly 


Phe 


Gly 


Me 


His 


Gin 


Cys 


Leu 


Gly 


Gin 


Asn 








355 










360 






ctg gag 


ate 


gec 


ctg 


ggc 


acg 


etc 


ctg 


gag 


egg 


Leu 


Glu 


Me 


Ala 


Leu 


Gly 


Thr 


Leu 


Leu 


Glu 


Arg 






370 










375 







acc cag. egg 
Thr Gin Arg 



60 

cgc acc etc ccg ggc 
Arg Thr Leu Pro Gly 
75 

cgc gac egg egg gtg 
Arg Asp Arg Arg Va I 
95 

egg atg 
Arg Met 
110 

ggg ctg egg ccc acc 
Gly Leu Arg Pro Thr 
125 

atg ate gag aag ggg 
Met Me Glu Lys Gly 
140 

ccc gtg ccc teg gtg 
Pro Val Pro Ser Val 
155 

gac cac gag ttc ttc 
Asp His Glu Phe Phe 
175 
gac teg 



acg gec gee 
Thr Ala Ala 



Asp Ser 
190 

ggc ggg ctg ate gac 
Gly Gly Leu lie Asp 
205 

gtc ctg. gac gac etc 
Val Leu Asp Asp Leu 
220 

egg cgc gac gtg gtg 
Arg Arg Asp Va I Val 
235 

gag acg acc gec aac 
Glu Thr Thr Ala Asn 
255 
egg ctg 
Arg Leu 
270 

gec gec gtg gag gag 
Ala Ala Val Glu Glu 
285 

ctg cgc ctg gec ctg 
Leu Arg Leu Ala Leu 
300 

gec ggc gag ggc gtc 
Ala Gly Glu Gly Val 
315 

tec gtg ttc 
Ser Val Phe 



cac ccc ggc 
His Pro Gly 



cac cac gtg 
His His Val 



gac gac 
Asp Asp 
335 
gec ttc 
Ala Phe 
350 

ctg gee cgc gec gag 
Leu Ala Arg Ala Glu 
365 

etc ccc ggc etc egg 
Leu Pro Gly Leu Arg 
380 



1558 

1606 

1654 

1702 

1750 

1798 

1846 

1894 

1942 

1990 

2038 

2086 

2134 

2182 

2230 

2278 

2326 

2374 

2422 

2470 
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ctg gcc gcg ccc gcc gag gag ate ccg ttc aaa ccc ggc gac acg ate 2518 
Leu Ala Ala Pro Ala Glu Glu Me Pro Phe Lys Pro Gly Asp Thr Me 

385 390 395 

cag ggg atg ctg gaa etc ccc gtg acc tgg taa gaggctctgg tc atg cac 2569 
Gin Gly Met Leu Glu Leu Pro Val Thr Trp Met His 

400 405 410 

ate gac ate gac aag gac cgc tgc ate ggc gcc ggc cag tgc gcg ctg 2617 
Me Asp Me Asp Lys Asp Arg Cys Me Gly Ala Gly GiruCys Ala Leu 

415 420 425 

gcc gcc ccg ggc gtg ttc acc cag gac gac gac ggc tac age acc ctg 2665 
Ala Ala Pro Gly Val Phe Thr Gin Asp Asp Asp Gly Tyr Ser Thr Leu 

430 435 440 

etc ccc ggc cgc gag gac ggc ggg ggc gac ccg atg gtc egg gag gcg 2713 
Leu Pro Gly Arg Glu Asp Gly Gly Gly Asp Pro Met Val Arg Glu Ala 

445 450 455 

gcc cgc gcc tgc ccg gtg age gcc ate egg gtg acc gaa ccg gcc ggc 2761 
Ala Arg Ala Cys Pro Val Ser Ala Me Arg Val Thr Glu Pro Ala Gly 
460 465 470 475 

tga ggcggggccc ggcggccgcg gcccgctgcc gggaccgccg ttcccagttc agtagg 2820 



gtcgtgcgat 
ggaccgccgc 
gggtcctggc 
tgacgggccg 
cgccgggcgc 
gtctcgccga 
tcgccgtcct 
ttctccggct 
aggaeggega 
tggageggge 
cctaacggca 
gcggcagcca 
ggacct tcac 
acgcgagccg 
gtggcgatcc 
ccctcgtcgc 
gtgtccggtc 



gacctcacag 
cggagtcacc 
cacgccggtg 
gecca teege 
cgtggtggtg 
cacgctcgcc 
gccctgcgtg 
gcgtgggatg 
ggeggaegge 
ctgaacccgc 
ggtcagcgcc 
gtagcccagc 
cgtctcggga 
ggtct tcgag 
gcaccccggt 
tgtcccggcg 
gac 



geegggaage 
aegctgat eg 
gcgatgggcg 
tcggcctggc 
gcgcccgcca 
gtcggcacgc 
gcggacgcgc 
ggcgtccgct 
gcacggcccg 
tccccgaccc 
ggcccggcca 
etggagacca 
cggccgggct 
gtaccggccg 
gcccgccccg 
gaccgtgtag 



ccttcctcta 
gcgccgccca 
ggttct tcga 
gctcgccggc 
ccttcaacac 
tetgegagge 
tggccgccca 
tcggcgagcc 
ggttcgcctg 
gtagggcctg 
gcatgccgcc 
ccgtggagca 
geagegeggt 
accagegggt 
ccgatgagtc 
gtcagcgtgg 



cgtcgtcgtc 
ggegegggge 
cacggctgcg 
cgatccgcgc 
cgtcaacaag 
ggcgggcctc 
ccccgcgtac 
gtacgccggc 
ggagaacgee 
tctgacactg 
ggtgtagagg 
gtcaggcccg 
cagcgcgcag 
tgacgcagcg 
cgagccgggc 
tggtggcgtc 



tgcgcggccg 
tgggaggtgg 
gtcgaggaga 
ccgt tcccgc 
tgggcggccg 
ggcgtgccga 
egggagggee 
ccgccggggg 
ctggacctgc 
tcagacaggc 
tcctggcccc 
aeggtgaege 
tccagggagt 
ggcgtcgtcc 
gctgccgtcg 
gcgccgcagg 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3793 



<210> 2 
<211> 2329 
<212> DNA 

<213> Streptomyces sp. 

<220> 
<221> CDS 

<222> (420).. (1604) 



<220> 
<221> CDS 
<222> (1643) 



(1834) 



<400> 2 

ggatccaegg gtggccgccg cgctcgcccg ggtgaccgac eggegtateg getatgtege 60 

cgcgctcttc gcggcgctgg gcttccccga gggcgaggcg cgggaccgcg gcctgctggc 120 

gtacaccgcc tacctcggcc acacccagct cggacatgcc gtccgacaga gcctgccggc 180 

egaggeggea cacgaccgct atctggatgg cgtgatcgac accctcgtac ggccgcggga 240 

eggaggegat gaagecgaac atgtcacaat ctgaacgagg ttggcggaac tgegegcaga 300 

acatgcccgg tatccgcggc atgaggtgag ateggegegg egaaacaegg tgcgccacag 360 
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cgttgccatc tcacacacga gcaactcgag ccacttgaga ctcgtacggg aggaaattc 419 

gtg acc gaa gcc ate ccc tac ttt cag aac cgc acc tgt ccc tac cac 467 

Val Thr Giu Ala Me Pro Tyr Phe Gin Asn Arg Thr Cys Pro Tyr His 

15 10 15 

ccg ccc gcc gcc tat cag cca ctg cgc ggg gcc ggc ccg ctg age cat 515 

Pro Pro Ala Ala Tyr Gin Pro Leu Arg Gly Ala Gly Pro Leu Ser His 

20 25 _ 30 

gtc acg ttc tac gac ggc egg aag gtg tgg gcg gtc acc ggc cac ccc 563 

Val Thr Phe Tyr Asp Gly Arg Lys Val Trp Ala Val Thr Gly His Pro 

35 40 45 

gag gca egg gcg ctg ctg acc gac cag cga etc tec gcc gac egg* cag 611 

Glu Ala Arg Ala Leu Leu Thr Asp Gin Arg Leu Ser Ala Asp Arg Gin 

50 55 60 

aac ccg gcc ttc ccg gtc ccc ttc gaa cgc ttc gcg gcc ate cgc egg 659 

Asn Pro Ala Phe Pro Val Pro Phe Glu Arg Phe Ala Ala Me Arg Arg 

65 70 75 80 

gtc egg acg ccg ctg ate ggg gtc gac gac ccg gag cac aac acc cag 707 

Val Arg Thr Pro Leu Me Gly Val Asp Asp Pro Glu His Asn Thr Gin 

85 90 95 

cgc egg atg ctg ate ccc age ttc age etc aag egg acc gcc gca ctg 755 

Arg Arg Met Leu Me Pro Ser Phe Ser Leu Lys Arg Thr Ala Ala Leu 

100 105 110 

egg ccg gag ate cag egg ate gtc gac ggg ctg etc gac egg atg ctg 803 

Arg Pro Glu Me Gin Arg Me Val Asp Gly Leu Leu Asp Arg Met Leu 

115 120 125 

gat cag ggc ccg ccc acc gag ctg gtc tec gcg ttc gcc ctg ccg gtc 851 

Asp Gin Gly Pro Pro Thr Glu Leu Val Ser Ala Phe Ala Leu Pro Val 

130 135 140 

ccg teg atg gtg ate tgc gca ctg etc gga gtc tea tac'gee gac cat 899 

Pro Ser Met Val Me Cys Ala Leu Leu Gly Val Ser Tyr Ala Asp His 

145 150 155 160 

gag ttc ttc gag gag gag tec cgc cgc ate ctg cgc ggc egg teg gcc 947 

Glu Phe Phe Glu Glu Glu Ser Arg Arg Me Leu Arg Gly Arg Ser Ala 

165 170 175 

gag gag gcg gag gac gcc egg ctg aag ctg gag gag tac ttc acc ggg 995 

Glu Glu Ala Glu Asp Ala Arg Leu Lys Leu Glu Glu Tyr Phe Thr Gly 

180 185 190 

ctg ate gcc gcc aag gag aag aac ccg ggc gac ggg ctg ctg gac gag 1043 

Leu Me Ala Ala Lys Glu Lys Asn Pro Gly Asp Gly Leu Leu Asp Glu 

195 200 205 

ctg ate gag gac egg ctg egg acc ggc gcg etc acc cgc gac gag ctg 1091 

Leu Me Glu Asp Arg Leu Arg Thr Gly Ala Leu Thr Arg Asp Glu Leu 

21 0 215 220 

gtc egg etc gcc atg ate ctg ctg gtg gcc ggc cat gag acc acc gcc 1139 

Val Arg Leu Ala Met Me Leu Leu Val Ala Gly His Glu Thr Thr Ala 

225 230 235 240 

aac atg ate teg etc ggc acc ttc acc ctg ctg gac cac ccc gag cag 1187 

Asn Met Me Ser Leu Gly Thr Phe Thr Leu Leu Asp His Pro Glu Gin 

245 250 255 

ctg gcg cag etc aag gcc gac gag ggc ctg atg ccg gcc gcc ate gag 1235 

Leu Ala Gin Leu Lys Ala Asp Glu Gly Leu Met Pro Ala Ala Me Glu 

260 265 270 

gag ctg ctg cga ttc ctg tec ate gcg gac ggc ctg ctg egg gtg gcg 1283 

Glu Leu Leu Arg Phe Leu Ser Me Ala Asp Gly Leu Leu Arg Val Ala 

275 280 285 

acg gag gac ate gag ate ggc ggt cag gtg ate egg gcc gac gac gcg 1331 

Thr Glu Asp Me Glu Me Gly Gly Gin Val Me Arg Ala Asp Asp Ala 

290 295 300 

gtc ctg ttc ccc gcc tea ctg ate aac egg gac gag gcc gcc tat ccg 1379 

Val Leu Phe Pro Ala Ser Leu Me Asn Arg Asp Glu Ala Ala Tyr Pro 
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305 310 315 320 

gca ccc gac gag ctg gac etc ggc cgt teg gee cgc cat cac gtg gcg 
Ala Pro Asp Glu Leu Asp Leu Gly Arg Ser Ala Arg His His Val Ala 

325 330 335 

tec ggc ttc ggg ate cac cag tgc ctg ggg cag aac etc gee cgc gcg 
Ser Gly Phe Gly lie His Gin Cys Leu Gly Gin Asn Leu Ala Arg Ala 

340 345 350 

gag atg gag ate gcg ctg cgc tea ctg ttc acc agg atcccg cag ctg 
Glu Met Glu Me Ala Leu Arg Ser Leu Phe Thr Arg Me Pro Gin Leu 

355 360 365 

egg etc gee gtg ccg gec gee gag att ccg ttc aag gac gga gac acc 
Arg Leu Ala Val Pro Ala Ala Glu I le Pro Phe Lys Asp Gly Asp' Thr 

370 375 380 

ctg caa ggc atg ate gaa ctg ccg ctg gee tgg tag cagccaggac ggcaga 
Leu Gin Gly Met lie Glu Leu Pro Leu Ala Trp 
385 390 395 

ccaaagaaag gggtccgga atg egg ate gcg ate gac acc gac cgc tgt ate 

Met Arg Me Ala Me Asp Thr Asp Arg Cys Me 
400 405 
ggc gee ggc cag tgt gee ctg acc gcg ccc ggg ggt ttc acc cag gat 
Gly Ala Gly Gin Cys Ala Leu Thr Ala Pro Gly Gly Phe Thr Gin Asp 

410 415 420 

gac gac ggt ttc agt gca ctg ctg ccc ggc egg gag gac ggc gec ggc 
Asp Asp Gly Phe Ser Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly 

425 430 435 

gac ccg ctg gtg egg gaa gec gee cgc gec tgc ccc gtg cag gec att 
Asp Pro Leu Val Arg Glu Ala Ala Arg Ala Cys Pro Val Gin Ala Me 

440 445 450 

gcg gtc acc gac gat tag cagcaccccc gcggacgacc cggcagacgc gcgcggcc 
Ala Va I Thr Asp Asp 
455 

ccggctgaca cccggcgccc gaggcgcgcc cgagccgtcc gcccctccac ttgtccctac 
ggcatccacc ccatccgcta ccgcaacacc ccttgggtga cgggcagttt cgaggacccc 
ggtgtgcccg gggcgtactg gtgacegtea ccggcttcac gecgegattg cccacatagg 
cgtcgtcgct cgcggcgatc aegaagegeg gtcggtgccc eggctegtaa eggtgeaega 
tgcccggcag ttccacggtg aaccgccggg ccacatcggg cacccgggcc ggggccacca 
acaggtgeae cagcgtcttc ctgccgttcg gcgcgacatc gtagagcttg gegaacagea 
ccagcttgtc cgccgcatcc gcggaccgct gcgcccgccc ggcctgcggc gaggcaacct 
tcagcgtcac cctcggcgcg cccaccacgt cgac 



1427 

1475 

1523 

1571 

1623 

1675 

1723 

1771 

1819 

1875 



1935 
1995 
2055 
2115 
2175 
2235 
2295 
2329 



<210> 3 
<211> 1860 
<212> DNA 
<213> Unknown 

<220> 
<221> CDS 

<222> (172).. (1383) 
<223> A-1560 strain 



<220> 
<221> CDS 

<222> (1399). . (1593) 
<400> 3 

eggggategt acgccgtacc gtttegggge aaccgaatta egatgeggaa tggatggttc 
ccagccagat cccgcaggta gecgatctgg ccgaacttga tgtcgtgcac tggatgeetc 
gggcatctaa tgaagategg cacgacgcat ccttcgtctg cgaggtctcc c atg aca 

Met Thr 
1 



60 
120 
177 
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gac acg 
Asp Thr 

gac egg 

Asp Arg 
20 

acc gaa 

Thr Glu 
35 

tgg etc 

Trp Leu 

cgc ctg 
Arg Leu 

cgc ttc 
Arg Phe 

gac ccc 
Asp Pro 
100 
etc agg 
Leu Arg 
115 

gaa ctg 
Glu Leu 

cgt tec 
Arg Ser 

ggc gtg 
Gly Val 

ctg ctg 
Leu Leu 
180 
etc gec 
Leu Ala 
195 

ggt gac 
Gly Asp 

gag etc 
Glu Leu 

gcg ggc 
Ala Gly 

etc ctg 
Leu Leu 
260 
ctg ctg 
Leu Leu 
275 

gac gga 
Asp Gly 

acc ate 
Thr Me 

cgc gac 



aca gac 
Thr Asp 
5 

age tgc 
Ser Cys 

egg ccg 
Arg Pro 

gtc acc 
Val Thr 

tec acg 

Ser Thr 
70 

gec gcg 

Ala Ala 
85 

aag cac 

Lys His 

cgc gee 
Arg Ala 

ctg gac 
Leu Asp 

ttc gcg 
Phe Ala 
150 
ccc tac 
Pro Tyr 
165 

cgc gga 
Arg Gly 

gcg tac 
Ala Tyr 

ggc ctg 
Gly Leu 

gac egg 
Asp Arg 
230 
cac gag 
His Glu 
245 

ctg cac 
Leu His 

ccg gee 
Pro Ala 

ctg ctg 
Leu Leu 

agg gee 
Arg Ala 
310 
gag gac 



ctg acc 
Leu Thr 

ccc tac 
Pro Tyr 

ccc gec 
Pro Ala 
40 

ggc cac 
Gly His 
55 

gac cgc 
Asp Arg 

gtc cgc 
Val Arg 

cgc acc 
Arg Thr 

acc gag 
Thr Glu 
120 
gtg atg 
Val Met 
135 

ctg ccg 
Leu Pro 

gec gac 
Ala Asp 

ccg gcg 
Pro Ala 

ctg gag 
Leu Glu 
200 
ctg gac 
Leu Asp 
215 

gag gaa 
Glu Glu 

acc acc 
Thr Thr 

ccc gaa 
Pro Glu 

gec gtc 
Ala Val 
280 
egg cag 
Arg Gin 
295 

ggg gac 
Gly Asp 

gtc tac 



gag ctg tea gat 
Glu Leu Ser Asp 
10 

cac ccg ccc acc 
His Pro Pro Thr 



25 

cgc ate egg etc 
Arg I le Arg Leu 



gee gtc 
Ala Val 

acc cgc 
Thr Arg 

gac cgc 
Asp Arg 
90 

cag egg 
Gin Arg 
105 

etc agg 
Leu Arg 

ate gec 
Me Ala 

gtg ccg 
Val Pro 

cac gag 
His Glu 
170 
gec gag 
Ala Glu 
185 

gac ctg 
Asp Leu 

gaa etc 
Glu Leu 

ctg acc 
Leu Thr 

gee aac 
Ala Asn 
250 
egg ctg 
Arg Leu 
265 

gag gaa 
Glu Glu 



gec cgt 
Ala Arg 
60 

teg ggc 
Ser Gly 
75 
aag ccg 
Lys Pro 

tgg atg 
Trp Met 

ccg cgc 
Pro Arg 

cag gga 
Gin Gly 
140 
tec atg 
Ser Met 
155 

ttc ttc 
Phe Phe 

gac acg 
Asp Thr 

ate gac 
I le Asp 

gtc cag 
Val Gin 
220 
gcg ctg 
Ala Leu 
235 

atg ate 



ccc gtc 
Pro Val 

ggg tac 
Gly Tyr 
30 

tac gac 
Tyr Asp 

45 
gac ctg 
Asp Leu 

ttc ccg 
Phe Pro 

gcg etc 
Ala Leu 

atg ate 
Met Me 
110 
ate cag 
I le Gin 
125 

ccc ccg 
Pro Pro 

gtg ate 
Val Me 

gag gac 
Glu Asp 



tec ttc ccc cag 
Ser Phe Pro Gin 
15 

gac ccg ctg cgc 
Asp Pro Leu Arg 



ggc cgc 
Gly Arg 

ctg gtc 
Leu Val 

gee aca 
Ala Thr 
80 

etc ggc 
Leu Gly 
95 

ccg age ttc acc 
Pro Ser Phe Thr 



ccc gee 
Pro Ala 
50 

gac ccc 
Asp Pro 

act ccc 
Thr Pro 

gtc gac 
Val Asp 



gag ate 
Glu Me 

gec gac 
Ala Asp 

tgc gec 
Cys. Al a 
" 160 
cag tec 
Gin Ser 
175 



gtc gac 
Val Asp 
130 
ctg gtg 
Leu Val 
145 

ctg etc 
Leu Leu 

agg egg 
Arg Arg 



cag gac gee egg gac egg 
Gin Asp Ala Arg Asp Arg 
190 



Met 

acc gag 
Thr Glu 

ctg atg 
Leu Met 



gag aag 
Glu Lys 
205 

cag cgt 
Gin Arg 

gcg atg 
Ala Met 

tec ctg 
Ser Leu 

ctg cgc 
Leu Arg 
270 
egg atg 
Arg Met 
285 

ate gag 
I le Glu 



egg cgc 
Arg Arg 

ctg aac 
Leu Asn 

ate ctg 
I le Leu 
240 
ggc acc 
Gly Thr 
255 

gec gac ccc gcg 
Ala Asp Pro Ala 



egg ccc 
Arg Pro 
210 
gaa ggc 
Glu Gly 
225 

ctg gtc 
Leu Val 

tac acg 
Tyr Thr 



gee acc gag gac 
Ala Thr Glu Asp 
300 

ggc gtg gtc ttc tec acc 
Gly Val Val Phe Ser Thr 
315 

ccg gee ccc gac acc etc 
6/12 



ctg tec 
Leu Ser 

ate gec 
Me Ala 

tct gtc 
Ser Val 
320 
gac ttc 



ate gcg 
Me Ala 
290 
ggg acc 
Gly Thr 
305 

ate aac 
I le Asn 

cac cgc 



225 

273 

321 

369 

417 

465 

513 

561 

609 

657 

705 

753 

801 

849 

897 

945 

993 

1041 

1089 

1137 

1185 



Arg Asp Glu Asp Val Tyr Pro Ala Pro Asp Thr Leu Asp Phe His Arg 

325 330 335 

teg acc cgc cac cac gtc gec ttc ggt ttc gga ate cac cag tgc etc 1233 
Ser Thr Arg His His Val Ala Phe Gly Phe Gly Me His Gin Cys Leu 

340 345 350 

ggc cag aac etc gec cgc acc gaa ctg gag ate gec ctg cgc acg etc 1281 
Gly Gin Asn Leu Ala Arg Thr Glu Leu Glu lie Ala Leu Arg Thr Leu 
355 360 365 .- 370 

etc gaa egg ctg ccc acg etc egg etc gec gee cca ccg gag gaa ate 1329 
Leu Glu Arg Leu Pro Thr Leu Arg Leu Ala Ala Pro Pro Glu Glu Me 

375 380 385 

ccc ttc aaa ccc ggc gac acc ate cag ggg atg ctg gaa etc ccc gtc 1377 
Pro Phe Lys Pro Gly Asp Thr Me Gin Gly Met Leu Glu Leu Pro Val 

390 395 400 

age tgg taa gaggctgecg tc atg cat ate gag ate gac aag gac cgc tgc 1428 
Ser Trp Met His Me Glu Me Asp Lys Asp Arg Cys 

405 410 
ate ggc gec gga cag tgc gec ctg acc gec ccg ggt gtg ttc acc cag 1476 
Me Gly Ala Gly Gin Cys Ala Leu Thr Ala Pro Gly Val Phe Thr Gin 

420 425 430 

gac gac gac ggc ttc agt gac ctg ttg ccc ggc egg gag gac ggc gec 1524 
Asp Asp Asp Gly Phe Ser Asp Leu Leu Pro Gly Arg Glu Asp Gly Ala 
415 435 440 445 

ggc gac ccg atg gtc egg gag gec gec agg gec tgc ccc gtg agt gec 1572 
Gly Asp Pro Met Val Arg Glu Ala Ala Arg Ala Cys Pro Val Ser Ala 

450 455 " 460 

ate acg ctg tec gag gac ggg tag ggggecgage cgcgccgccc gccggtccgc 1626 
He Thr Leu Ser Glu Asp Gly 
465 

tgccgcggcg ccgtgccgac gcggcggccg gccggcccgt ccggtgc'ccg tcgcgtcgcc 1686 
ccgtggcccc ggeggegget gattgactag ggttcccggg tgagegaaca ggcccagaag 1746 
ccctccgggg cgccgcccgc gaaagacacc gggaeggege ccgggaaacc ccttcctcta 1806 
cgtcgtcgtc tgcgccgccg gcatcgccga aggegtcage aagctgatca cege 1860 

<210> 4 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 5Dm-3F Primer 

<400> 4 

ttcgcsctsc csgtcccstc satggtsat 29 

<210> 5 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 

7/12 



<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 5Dm-3R Primer 
<400> 5 

gttgatsays gasgtsgaga a 



<210> 6 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 6PIN-2F Primer 

<400> 6 

gctgcgcctg gccctggagg acatcgagat 



<210> 7 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : si-ngle 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 6PIN-2R Primer 

<400> 7 

ctgttcctcg aagaactcgt ggtcggcgta 



<210> 8 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : DM-NdeF Primer 

<400> 8 

8/12 



gcccccatat gacggaactg acggacatca 



<210> 9 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : DM-SpeR Primer 

<400> 9 

gggccactag tcagccggcc ggttcggtca 



<210> 10 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : DM-BglF Primer 

<400> 10 

cgcatagatc ttcacccgag cgggtgatca 



<210> 11 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : DM-BglR Primer 

<400> 11 

tcccgagatc ttgaaggtcc gcgtcaccgt 



<210> 12 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

9/12 



<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
-<220> 

<223> Description of Artificial Sequence : 50-1 R Primer 

<400> 12 

aggtgcccag cgagatcatg tt 



<210> 13 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 7PIN-2F Primer 

<400> 1 3 

ccatgatcct gctggtggcc ggccatgaga 



<210> 14 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 07-NdeF Primer 

<400> 14 

gccccatatg accgaagcca tcccctactt 



<210> 15 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
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<220> 

<223> Description of Artificial Sequence : 07-SpeR Primer 
<400> 15 

gccactagtg ctaatcgtcg gtgaccgcaa 



<210> 16 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 5Dm-2R Primer 

<400> 16 
ctggatsgtg tcsccsggyt t 



<210> 17 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : 5PIN-2F Primer 

<400> 17 

cggaatccac cagtgcctcg gccagaacct 



<210> 18 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : tpm-NdeF Primer 

<400> 18 

ggccccatat gacagacacg acagacctga 
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<210> 19 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> STRANDNESS : single 
<220> 

<223> TOPOLOGY : linear 
<220> 

<223> Description of Artificial Sequence : tpm-SpeR Primer 

<400> 19 

gcgcgactag tccccctacc cgtcctcgga 30 
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